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Abstract 

We propose a quantitative approach for calibrating and validating key features of traf- 
fic instabilities based on speed time series obtained from aggregated data of a series 
of neighboring stationary detectors. We apply the proposed criteria to historic traffic 
databases of several freeways in Germany containing about 400 occurrences of con- 
gestions thereby providing a reference for model calibration and quality assessment 
with respect to the spatiotemporal dynamics. First tests with microscopic and macro- 
scopic models indicate that the criteria are both robust and discriminative, i.e., clearly 
distinguishes between models of higher and lower predictive power. 
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1. Introduction 

Traffic flow models displayin g instabili t ies and stop-and-go waves have been pro- 
po sed for more than fifty years dReuschell [1950: Pipea fl953l) (see also the reviews 



bv lHelbing ( 2001 ); Hoogendoorn and Bovvl d200lh). Obs ervations of instabilities date 



back several decades as well ( Trei terer and Mversl 1 1 974f) . Nevertheless, a systematic 



concept for quantitatively calibrating and validating model predictions of instabilities 
against observed data seems still to be missing. A possible reason for this are intrica- 
cies in the data interpretation and wrong expectations about what can be achieved and 
what not. 

In principle, traffic instabilities and oscillations in congested traffic can be captured 
by several types and representations of data: flow-density points and speed time se- 
ries of aggregated stationary detector data (SDD), passage times and individual speeds 
from single-vehicle SDD, floating-car data captured by the vehicles themselves, and 
trajectory data derived from images of cameras at high viewpoints. 
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Figure 1 : (a) Comparison of empirical and simulated flow-density data; (b) simulated ground truth for the 
density with the modeled fundamental diagram (solid curve). 



At first sight, flow-density data seem to be useful for representing traffic instabil- 
ities since oscillations are non-stationary phenomena in which traffic is clearly out of 
equilib rium. This should lead to a wide scattering in the flow-density data of congested 
traffic ( [Kerner and Rehbornll 19961) as shown in Fig.Q]regardless of the theoretical ques- 
tion w hether traffic flow has a definite equ i librium relation ("fun damental diagram"), 
or not dNishinari et all 120031 Ikernerl 120041 ITreiber et all | 2010al). 

In fact, simulations with the Intelligent Driver Model dTreiber et alll200oh . as well 
as with a wide variety of other models, show wide scattering even for identical drivers 
and vehicles (Fig. Q]). Nevertheless, flow-density data cannot be used for a quantita- 
tive calibration since other effects such as inter-driver variability (heterogeneous traf- 
fic) dBanksl 1 19991: bssen et all l2007t ITreiber and Helbind] 199% infra-driver variabil- 
ity (drivers change their behavior over tim et dTreiber and Helbingj, 2003), and many ef- 
fects related to lane changes and platoons dNishinari et all 120031 : ITreiber et alll2006bl) 
may lead to wide scattering as well. 

Moreover, since the density is not directly measurable by stationary detectors, it 
must be estimated, typically by making use of the hydrodynamic relation "flow di- 
vided by the average velocity". The available temporal average, however, systemati- 
cally underestimates the true spatial average relevant for this relation which results in an 
extre me bias in the flow-density data of congested traffic dLeutzbachl 1 1 9881: iHelbingl 



/UUl). fugure yj illustrates this by means ot simulations with the Intelligent Driver 
Model by ITreiber et al. (2000)). This model has been calibrated such that the aggre- 
gated virtual detector shows a similar scattering as the empirical data (Fig.QJa)). Plot- 
ting the same data points using the true spatial density (which, of course, is available 
in the simulation) reveals the systematic errors (Fig.QJb)). 

Alternatively, detailed information can be extracted from single-vehicle data, floating- 
car data, or trajectory data. However, since oscillations are collective phenomena, 
the high level of detail is not necessary. Moreover, to date, the availability of these 
data types is still limited, o r the data are not suitable for our purpo ses. Specifically, 
the NGSIM trajectory data dUS Department of Transportation! 2007 ) widely used as a 
testbed for these types of data cover road sections of typically 2 km in length while 
stop-and-go waves develop on typical scales of 5-10km. 

In contrast, speed time series obtained from aggregated stationary detector data are 
widely available (see Fig. [2]) and thus suitable for systematic calibration and bench- 
marking purposes. Moreover, since temporal averages are the appropriate mean when 
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www.traffic-states.co m 




Figure 2: Screenshot of a publicly available image database of congestions on the German freeway 
A5. The database can be searched by criteria such as direction of travel, space interval, time 
interval, weekday, cause of the congestion, or the type of traffic pattern. In the example, the 
database has been filtered for jams occurring on Mondays in the southe rn driving direction. T he 
images have been produced using a dedicated reconstruction technique ( ITreiber et alluOlObl) , 



analyzing time series, no systematic errors arise. Using this type of data, the wave- 
le ngth and propag a tion velocities of oscil l ations on freeways has b een investigated 

bvlLindgrenld2005l):lBertini and Leall d2005l) : lAhn and Cassidvl (f2007); Schonhof and Helbing 
d2007l) : IWilsonl (12008bl) and compared for several countries by Ziel ke et alj ([2008). 
However, the proposed schemes are not suitable for a systematic automated analysis. 
Moreover, to our knowledge, there seem to be no published work quantitatively deter- 
mining the growth rate of the oscillations. 

In this paper, we propose a systematic approach for calibrating and validating key 
features of traffic instabilities based on speed time series obtained from aggregated 
stationary detector data. This includes the propagation velocity, the wavelength, and 
the growth rate of the oscillations. We give first results based on a historic congestion 
database of the German freeway A5, and show that the method has a high discrimina- 
tive power in assessing the model quality. 

The proposed criteria are described in Sec. [2] and applied to observed data in Sec. [3] 
A first application to traffic flow models is given in Sec.|4]before Sec.[5]concludes with 
a discussion and an outlook. 



2. The Proposed Quality Measures 

The formulation of the cr iteria is based on qualitative empirical findings ("stylized 
facts") (ITreiber et all l2010al) that are persistently observed on various fr eeways all 



over t he world, e.g., the USA, Great Britain, the Netherl ands, and Germany (|Lin dgren, 



2005; Bertini and Leal, 2005; Ahn and Cassidv, 200 
Zielke et all 120081; iHelbing et all 120091: IWilsonl I2OO8 
stylized facts are the following: 



Schonhof and He lbing, 



2007 



For our purposes, the relevant 



1 . Congestion patterns are typically caused by bottlenecks which may be caused, 
e.g., by intersections, accidents, or gradients. Analyzing about 400 congestion 
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patterns on the German freeways A5-North and A5-South (Fig. [2]) did not bring 

concl usive evidence of a single breakdown without a bottleneck dSchonhof and Hel bing, 



2007). 



The congestion pattern can be localized or spatially extended. Localized pat- 
terns, as well as the downstream boundary of extended congestions, are either 
pinned at the bottleneck, or moving upstream at a characteristic velocity c cong . 
Only extended patterns (Fig. [3) and moving localized patterns (isolated stop-and- 
go waves, Fig. are relevant for our investigation. 

Most extended traffic patterns exhibit internal oscillations propagating upstream 
approximately at the same chara cteristic velocity c « c cong — const (Daganzo, 
20021 ; iMauch and Cassidv , 2002 ). In spatiotemporal representations such as that 
of Fig.|4]this corresponds to nearly parallel structures. For free traffic, the prop- 
agation velocity c is somewhat lower than the average vehicle speed V. 
The amplitude of the oscillation s increases while propagating upstream (IBertini and Leal 
2005t lAhn and Cassidvl 12007). We quantify this by defining a temporal growth 
rate cr characterizing the growth process before saturation. 
The average wavel ength L of the ( saturated) w aves increases with decreasing 
bottleneck strength (ITreiber et al. , 2000, 2010a). 



In formulating the calibration criteria, We use the first two facts to filter the input 
data for applicable spatiotemporal regions (Sec. I2.21 > while the quantities c, cr, and L 
quantifying the remaining stylized facts constitute the criteria itself (Sections 12.4112. 61 . 
In general, c, cr, and L depend on the bottleneck strength which, therefore, needs to be 
quantitatively defined as well (Sec. 12.31 ). 



2.1. Data Description 

We assume that aggregated data for the flow Q and speed V (arithmetic average) are 
available from several detector cross sections at locations i. The distance between two 
adjacent detectors should not exceed about 2.5 km which generally is satisfied on rele- 
vant freeway sections (i.e., sections where traffic breakdowns are observed regularly). 
Usually, the flow and speed data {Vu,, Qm) of cross section i are available for each lane 
(index after each time interval (index t). With few exceptions, congested traffic is 
"synchronized" between lanes, i.e., the average speed at a given location and time is 
not significantly different across the lanes. We therefore use the following weighted 
averages as input for our calibration criteria: 

V it = ±-Y J Qu t V i n (1) 
i=i 

where n, is the number of mainroad lanes at location i. The total flow is given by 

Qi, = £ Qm. (2) 
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2.2. Filtering Spatiotemporal Regions 

In order to determine the dynamic quantities c, <x, and L from the input data {V,,} it 
is necessary to restrict the spatiotemporal range to regions of actual congestions. From 
the stylized facts above, we use the information that the downstream boundary of con- 
gestions is either stationary at a bottleneck, or moving with a constant velocity c cong . 
Furthermore, internal structures move at nearly the same velocity. This leads to defin- 
ing parallelogram-shaped regions for both isolated stop-and-go waves, and extended 
congestions (cf. Figs.[4]and[3]). The parallelogram is defined as follows: 

• Two edges correspond to detector locations. The downstream edge is defined 
by the detector that is nearest to the bottleneck (in upstream direction). The 
upstream edge is at the detector location where the waves begin to saturate. This 
should include n > 3 detector cross sections. In the following, we will count the 
cross sections in the direction of the flow, i.e., the upstream edge is located at x\, 
and the downstream edge at x n . 

• The other two edges are angled such that, in the spatiotemporal picture, the waves 
propagate parallel to these edges. 

• The vertices {x\, fj 6g ) and {x\, t™ A ) connected by the upstream edge are defined 
such that the parallelogram has a maximum area subject to the condition that 
there is no free traffic inside: 

t h t es = maxfc-^il (3) 



c, 



coir: 



u cong 



fl end = min k-?!—*. \ : =t* s + T. (4) 



Here, is the time where the aggregated speed time series V„ of cross section 
i drops below a critical speed v c , and f, is the time where it rises above v c . To 
reduce the occurrence of false signals, the time series can be smoothed over time 
scales of a few minutes just for this step. A value v c = 70km/h turned out to 
give good results for freeways. 

2.3. Quantifying the Bottleneck Strength 

For extended congestions, the quantities c, cr, and L depend on the bottleneck 
strength. Therefore, this exogenous variable has to be characterized by the measured 
quantities as well. Theoretically, the bottleneck strength is defined by a local dro p of the 



capacity that is available for the mainroad traffic dSchonhof and Helbing, 120071) . How- 
ever, this quantity is problematic to measure, particularly, if the bottleneck includes 
merging/diverging traffic (e.g., junctions, intersections, or lane closings). Therefore, 
we use as proxy the average vehicle speed at the detector nearest to the bottleneck, 

t=t, 8 +Ar 
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Figure 3: Example of stationary detector data displaying the development of stop-and-go waves from station- 
ary congested traffic near the bottleneck, (a) Recons truction of the spatiote mporal evolution of local vehicle 
speeds using a dedicated reconstruction technique i Treiber et al., 2010b). Also shown is a reconstructed 
vehicle trajectory (black curve), (b) Speed-time series of the reconstructed trajectory, (c) Speed-time series 
according to Eq. (TJ. The locations of the respective detectors are given by the black baselines, and the 
speed (reference value 90km/h) is plotted relative to the baseline. The black parallelogram encloses the 
relevant spatiotemporal range defined in Sec. 12.21 (d) Representation of the same data with a skewed time 
axis f = t— (x — *i)/c c ong with x\ = 473 km and c cone = -16 km/h. 



At this location, the oscillations inside extended jams are minimal (Stylized Fact 4) 
making V a well-defined quantity. In order to plot the characteristic quantities of iso- 
lated stop-and-go waves into the same diagram, such as Fig.|5j we define V as the speed 
outside of the wave, for this case. 

2.4. Propagation Velocity 

The propagation velocity c can be calculated by maximizing the sum of cross- 
correlation functions of speed time series of detector pairs [i, j] in the congested region 



(at positions x, and x,, re spectively), with respect to the velocity c ( Coifman and Wana, 



2005; Zielke et al., 2008) 



arg max ^ ^ Corr V,(f), V) \t + *' * j \ 

i j>i 



(6) 



where the continuous-in-time function V,(f) is defined by a piecewise linear interpola- 
tion of the detector time series V,-,. Stylized Fact 3 suggests that the values of c lie in 
a very small range which is confirmed by Fig. EJd). This makes it possible to define 
the spatiotemporal region using the a-priori estimate c cong without danger of circular 
reasoning. When selecting spatiotemporal regions of free traffic, Eq. © can be used 
to determine the propagation velocity of perturbations in free traffic (data points at the 
upper right corner of Fig. [2d)). 
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2.5. Growth rate 

The average spatial growth rate & of perturbations is defined in terms of the slope 
of the linear regression of the logarithm of the amplitude, 

2,x,ln|A,| -nx\n\Ai\ 
XjiXf -nx 

where the amplitude A, is approximated by the standard deviation of the speed time 
series V„ inside the parallelogram. Of course, & is only a rough approximation of the 
true growth rate. Particularly, & is influenced by non-collective fluctuations caused by 
random noise and driver-vehicle heterogeneity as well as by saturation effects, partic- 
ularly, if the spatiotemporal region is not defined carefully. Both effects systematically 
decrease the estimated growth rate but do not invalidate the overall picture. However, it 
is essential to calculate the simulated quantities using exactly the same measuring and 
estimation procedures. 

Once the spatial growth rate is known, the temporal growth rate is given by 

a- = efr. (8) 

Notice that, in general, <x is negative, and <x positive. 



2.6. "Wavelength 

The average period t of the oscillations of extended congestions is given by the 
position of the first nontrivial peak of the autocorrelation functions of detector time 
series inside the parallelogram. In some cases, waves merge during their propagation 
such that the period and the wavelength increases with the distance from the bottleneck. 
We therefore define t at a certain development stage, namely at the limit of saturation: 

r = arg max Corr [Vi (f), Vj (t + r')] . (9) 

T'>() 

In analogy to Eq. ([SJ, the associated wavelength is given by 

L = \c\t. (10) 



3. Application to Freeway Data 

The characteristic quantities defined in the previous section have been calculated 
for about 30 instances of traffic congestions observed on a section of the German free- 
way A5-South near Frankfurt for April and May 2001. Images of the corresponding 
spatiotemporal speed fields can be viewed at www.traffic-states.com (cf. Fig. [2] 
for a screenshot). Not all of the more than 60 jams observed in the southern direction 
in this time period are suited for the analysis: Localized jams must be moving (filter 
criterion "MLC"), and extended jams must exhibit distinct oscillating structures (filter 
criteria "OCT" or "TSG"). Additionally, they must be sufficiently extended such that 
the parallelogram includes at least three cross sections, and each cross section records 
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Figure 4: Stationary detector data displaying the evolution of isolated stop-and-go waves caused by a weak 
bottleneck located at about 487.5 km. (a) The spatiotemporal speed field, (b) the time series, and (c) the 
time series with skewed time axis are derived from the data as in Fig. [3] The parameters of the skewed time 
transformation t' = t - (x - x\ )/c cong are given by X\ = 483 km and c cong = — 16 km/h. 
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Figure 5 : Characteristic properties of the propagation of perturbations in congested traffic, (a) Growth rate 
a", (b) wavelength L, and (c) propagation velocity c, as a function of the average speed V characterizing the 
bottleneck strength. 
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Figure 6: Speed-time series of several detectors on the US American freeway OR 217 iZielke et al.Ll2008l) . 
Shown are deviations (oscillating curves) from the average speed (horizontal baselines). The detector spac- 
ings are proportional to the spacings of the baselines. Also shown is the time period 1/cr in which the 
amplitudes of each wave increase by a factor of e. 

at least three oscillations. While oscillating structures are observed for most of the 
cases except for very strong or weak bottlenecks (filter criteria "HCT" and "HST", re- 
spectively), nearly 50 % of the remaining candidates do not satisfy the minimum size 
criteria. 

Figure [5] displays the main result which can be used not only for calibration pur- 
poses, but also for understanding the traffic dynamics in general. Part (a) of this figure 
shows the growth rate of the moving structures inside extended congestions (points 
for V below 70 km/h), the growth rate of isolated stop-and-go waves (MLC, top right 
corner), and, for comparison, the growth rate of some moving structures in free traffic. 
For congested traffic, the growth rate is generally positive, apart for very low average 
velocities V corresponding to extended congestions behind strong bottlenecks (usually 
accident-caused lane closures, selected in the database by the criterium "accident"). 
This positive growth rate can be directly experienced by the driver (or alternatively ob- 
tained from floating-car data) as shown in Fig. Ob): When approaching the bottleneck , 
the experienced velocity oscillations apparently decrease in amplitude with the growth 
rate 

o"fc = Vfcit. (11) 

They have the apparent period 

TFC = T (^U) (12) 
\\c\ + V FC ) 

where V F c is the average speed of the floating car. 

Figure|3b) shows that the wavelength decreases with increasing bottleneck strength, 
but always remains above 1 km. Finally, Fig [2c) confirms that the propagation veloc- 
ity is nearly constant (in the range -17 km/h to -16 km/h for extendedjams and about 
-18 km/h for isolated moving waves). 

The analysis quantitatively confirms the stylized facts presented in Sec. [2] Par- 
ticularly, there is no empirical evidence against the existence of linear instabilities in 
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Figure 7: Examples of simulated virtual detector data based on simulations with the Intelligent Driver Model 
for a permanent bottleneck (plots a and b), and the Human Driver Model with a temporary bottleneck (c, d). 
The simulated data aggregation (time intervals of 1 min), and the transformations to display the data are the 
same as for the empirical data in Fig. [5] 



congested traffic. This is also confirmed by data from other freeways in several coun- 
tries, although there are quantitative deviations ( this, exactly, i s the b asis for calibrating 
the models). As an example, Fig.[6]taken from lZielke et al.1 d2008l) shows speed time 
series which are suitable for the present analysis. The growth rate is comparable to the 
A5 dat a while the propag ation velocity c « -20 km/h has a somewhat higher absolute 



value (Zielke et al., 2008) . 



4. A First Test of Traffic Flow Models 

In this section, we will show how the results can be applied to calibrating the dy- 
namic aspects of tra ffic flow models by testing the Intellig ent-Driver Model (ID M) 
dTreiber et all EoOO) and the Human Driver Model (HDM) jTreiber et all l2006al) on 



the extended congestion shown in Fig. [3] Both models are time-continuous car-following 
models, but the calibration procedures can be applied as well to discrete-in-time mod- 
els, cellular automata and to macroscopic models. 

In the simulations, the bottleneck is represented by an on-ramp. The on-ramp flow, 
i.e., the bottleneck strength, is chosen such that the average velocity V near the bot- 
tleneck, Eq. [5] is consistent with the data. At t = 70 min, the bottleneck is reduced 
resulting in a moving downstream front of the congestion at later times (this has no in- 
fluence on the calibration results). Virtual detectors are positioned every kilometer, and 
vehicle passage times and speeds are aggregated in one-minute intervals, in analogy to 
the data sampling at the real detectors. The result is displayed in Fig. [7] 
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Figure 8: A moving bottleneck filtered from the image database www.traf f ic-states . com: Validating a 
model with data it most probably has never seen before. 

The criteria for the actual calibration and assessment are the propagation velocity 
c, wavelength L (or period f), and growth rate <x. They are calculated from the virtual 
detector data with exactly the same scheme that has been used for the real data. Since 
the speed V, i.e., the bottleneck strength, is fixed, there are no degrees of freedom by 
manipulating the bottleneck. A comparison with the empirical values cr ODS = 5h _1 , 
Cobs = -17 km/h, and r b s = lOmin (cf. Fig. 01 gives a first impression of the dis- 
criminative power of the proposed calibration and testing method: The empirical prop- 
agation velocity c agrees with both models. However, for the IDM simulation, the 
period tidm = 6min of the waves is too short, and the growth rate ct ID m ~ lOfr 1 
(calculated using the detectors at 0, 1, and 2 km) is too high. It turns out that this 
cannot be improved significantly by changing the model parameters, i.e., calibrating 
the model. For the HDM simulation, the period thdm = 8 min deviates less, and the 
growth rate ct HD m ~ 5 fr 1 is consistent with the data. Since, in contrast to the IDM, the 
HDM includes explicit reaction times and multi-anticipation (the drivers take into ac- 
count multiple leaders), this may mean that both aspects are essential for a quantitative 
modelling. However, it is too early to draw definitive conclusions. 

5. Discussion and Outlook 

In conclusion, we propose three aspects of traffic instabilities that are accessible to 
a systematic and quantitative test of model prediction: the wavelength, the propagation 
velocity, and the growth rate. Since all properties are functions of the average velocity 
indicating the bottleneck strength, there should be enough input for a discriminative 
testing of models. All quantities are defined in terms of automated schemes enabling 
the systematic investigation of large-scale data bases. As a first step, it is planned to 
investigate the complete database displayed at www. traffic-states . com containing 
more than 400 instances of congestions. 

The database contains also some unusual patterns to which the scheme of Sec. [2] 
can be easily generalized and which may be used to validate the predictive power of 
models calibrated to the more conventional patterns discussed above. For example, 
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there are some searchable instances of "moving bottlenecks" (Pig. [8j, possibly caused 
by very slow special transports or obstructions due to "moving road works" (such as 
mowing the median lawn). 

We notice that all proposed quantities for calibrating and testing relate to existing 
congestions. Due to the stochastic nature of the perturbations eventually leading to a 
breakdown, predictions for an actual breakdown are nece ssarily restricted to a proba- 



bilistic level (Elefteriad ou et all 1 1995: Bril on et all 12005) 



It must be emphasized that calibrations based on microscopic quantities such as 
the difference between measured and simula ted gaps are strongly influenced by un 



predi ctable intra- and inter-driver variations (Ossen et all 1 2007: Resting and Treiber. 



2008 ) resulting in little discrim i native power (jBrockfeld et all 12004 ; Raniitkar et al 



trongi 

ioo3 

HIE 



2004 : Ossen and Hoogendoorri 2005 : Punzo and SimoneM I2Q05H . However, when 



calibrating against integrated quantities such as the traveling time ( Br ockfeld et al 



120031) . important aspects such as accelerations or the properties of oscillations are av- 
eraged away resulting in reduced discriminative power as well. In contrast, the schemes 
proposed here probe the traffic dynamics on the intermediate scale of individual oscil- 
lations which is of the order of one kilometer and a few minutes. Since we probe 
on collective phenomena, inter- and intra-driver variabilities should only play a minor 
role. However, the scale is sufficiently microscopic to retain accelerations. We there- 
fore expect that the proposed schemes can be applied to a better and more discrim- 
inative benchmarking of microscopic and macroscopic models with respect to traffic 
instabilities. This is the topic of ongoing work. 
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